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Abstract 

Background: Receptor-like kinases (RLKs) play key roles during development and in responses to the environment. 
Despite the relevance of the RLK family and the completion of the tomato genome sequencing, the tomato RLK family 
has not yet been characterized, and a framework for functional predictions of the members of the family is lacking. 

Results: To generate a complete list of all the members of the tomato RLK family, we performed a phylogenetic analysis 
using the Arabidopsis family as a template. A total of 647 RLKs were identified in the tomato genome, which were 
organized into the same subfamily clades as Arabidopsis RLKs. Only eight of 58 RLK subfamilies exhibited specific 
expansion/reduction compared to their Arabidopsis counterparts. We also characterized the LRRII-RLK family by 
phylogeny, genomic analysis, expression profile and interaction with the virulence factor from begomoviruses, the nuclear 
shuttle protein (NSP). The LRRII subfamily members from tomato and Arabidopsis were highly conserved in both 
sequence and structure. Nevertheless, the majority of the orthologous pairs did not display similar conservation in the 
gene expression profile, indicating that these orthologs may have diverged in function after speciation. Based on the fact 
that members of the Arabidopsis LRRII subfamily (AtNIKI, AtNIK2 and AtNIK3) interact with the begomovirus nuclear 
shuttle protein (NSP), we examined whether the tomato orthologs of NIK, BAK1 and NsAK genes interact with NSP of 
Tomato Yellow Spot Virus (ToYSV). The tomato orthologs of NSP interactors, SINIKs and SINsAK, interacted specifically with 
NSP in yeast and displayed an expression pattern consistent with the pattern of geminivirus infection. In addition to 
suggesting a functional analogy between these phylogenetically classified orthologs, these results expand our previous 
observation that NSP-NIK interactions are neither virus-specific nor host-specific. 

Conclusions: The tomato RLK superfamily is made-up of 647 proteins that form a monophyletic tree with the 
Arabidopsis RLKs and is divided into 58 subfamilies. Few subfamilies have undergone expansion/reduction, and only six 
proteins were lineage-specific. Therefore, the tomato RLK family shares functional and structural conservation with 
Arabidopsis. For the LRRII-RLK members SINIK1 and SINIKS, we observed functions analogous to those of their Arabidopsis 
counterparts with respect to protein-protein interactions and similar expression profiles, which predominated in tissues 
that support high efficiency of begomovirus infection. Therefore, NIK-mediated antiviral signaling is also likely to operate 
in tomato, suggesting that tomato NIKs may be good targets for engineering resistance against tomato-infecting 
begomoviruses. 
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Background 

Plant cells constantly react to multiple signals that come 
from the local environment, neighboring cells, or even 
from other organisms. Depending on the stimuli, plant 
cells may expand, divide, differentiate, synthesize com- 
pounds, prepare against pathogen infection, or induce 
necrosis [1]. To perceive and receive these signals, plant 
cells possess complex systems of transmembrane recep- 
tor proteins that facilitate communication between the 
intracellular environment and the outside world. 

One of the largest groups of these receptors is the 
receptor-like kinase (RLK) superfamily, which contains 
over 600 members in Arabidopsis [2-4]. RLKs are struc- 
turally organized into an extracellular domain that can 
be highly divergent, followed by a transmembrane seg- 
ment and a conserved intracellular serine/threonine kin- 
ase domain. Most RLKs are localized in the plasma 
membrane, although there are also RLK members that 
are found in the cytoplasm. In this case, RLKs do not 
possess either an extracellular region or a transmem- 
brane domain and are called receptor-like cytoplasmic 
kinases (RLCKs). Analyses of Arabidopsis RLKs by struc- 
tural comparison of their extracellular region and phylo- 
genetic analysis of their kinase domain revealed that 
they can be divided into over 50 subfamilies [5]. 

Several distinct RLKs have been studied in the past dec- 
ade, and a common theme that has emerged is that bind- 
ing of a specific signal molecule to their extracellular 
domain is required to initiate a signal transduction cascade 
[6]. Generally, ligand-receptor interactions at the extracel- 
lular domain of RLKs initiate the propagation of the signal 
through the membrane by inducing a conformational 
change in the receptor kinase domain, which allows inter- 
actions with other RLKs resulting in homo- or heterodi- 
mers. Dimerized RLKs are then transphosphorylated by 
their cytoplasmic kinase domain, leading to both activation 
of the kinase and establishment of docking sites for phos- 
phorylation of downstream phosphorylation targets [7,8]. 
This activation mechanism of plant RLKs is similar to that 
of signal transduction mediated by receptor tyrosine 
kinases in animal cells, which share a common origin with 
plant serine/threonine kinases [3]. 

Functional analysis of RLKs indicates that the majority 
of them are associated with plant development or defense 
response, but there are also RLKs involved in cell wall at- 
tachment (extensin, proline-rich extensin and lectin 
RLKs), plant-bacterial symbiotic interactions (LysM RLKs) 
and self-incompatibility (S-domain containing RLKs). 
Among all RLKs, those bearing a leucine-rich repeat 
(LRR) domains are overrepresented in the RLK superfam- 
ily, comprising over 38% of Arabidopsis RLKs, which are 
distributed into 15 subgroups (LRR I to LRR XV). The 
LRR domains in these receptors vary in number (from one 
to 25) and in the distribution pattern of the LRRs along 



the extracellular region. Examples of well-known LRR- 
RLKs include CLAVATA1, which controls the size of stem 
cells in the apical meristem by forming a heterodimer with 
CLAVATA2 and then interacting with CLAVATA3 through 
the extracellular domain [9], and BRASSINOSTEROID 
INSENSITIVE-1 (BRI1) [10,11], which perceives brassinos- 
teroids and interacts with it receptor partner, BAK1 
(BRI1-ASSOCIATED KINASE-1) [12,13]. Other functions 
associated with LRR-RLKs include morphogenesis [14-20], 
embryogenesis [21-24], pollen self-incompatibility [25] and 
responses to environmental signals [26]. In addition, some 
LRR-RLKs are known to function as regulators of defense 
response to bacterial pathogen [27-29], necrotrophic fungus 
[30] and viral infection [31,32]. 

Most of the characterized RLKs are from model plants 
such as Arabidopsis and Medicago truncatula, but signifi- 
cant efforts have been made to expand these studies to 
relevant field crops. Large-scale comparative analyses of 
Arabidopsis RLKs with rice [5,33,34] and soybean [35] 
RLKs identified over 1000 kinase proteins in rice and 600 
in soybean belonging to the RLK superfamily; almost all 
members were grouped into previously determined Arabi- 
dopsis RLK subfamilies. The RLK subfamilies with devel- 
opmental function have conserved size, whereas those 
involved in defense response have expanded their mem- 
bers, mainly by tandem duplication [5]. 

Although tomato (Solanum lycopersicum) is one of the 
most consumed and cultivated field crops in the world, a 
large-scale phylogenetic analysis of tomato RLKs has not 
yet been performed, and few members of the tomato RLK/ 
Pelle family (RLKs + RLCKs) have been studied and 
characterized. These members include Pto [36], Ptil (Pto- 
INTERACTING 1) [37], and Bti9 (AvrPtoB-TOMATO 
INTERACTING PROTEIN 9) [38], which interact with 
Pseudomonas syringae elicitors; TARK1 (TOMATO ATYP- 
ICAL RECEPTOR KINASE-1) [39], which interacts with 
the Xanthomonas campestris elicitor; TPKlb (TOMATO 
PROTEIN KINASE 1) [40], whose expression is induced by 
mechanical wounding and oxidative stress; and SR160 
(SYSTEMIN RECEPTOR) [41], which is the AtBRIl ortho- 
log and binds to systemin to respond to wounding or herbi- 
vore attack, although there is some debate about the 
function of this receptor [42], Another well-studied RLK in 
tomato is NIK (NSP-INTERACTING KINASE), which 
interacts with nuclear shuttle protein (NSP) of geminivirus 
during infection [43]. Three homologs of NIK in Arabidop- 
sis (AtNIKl, AtNIK2 and AtNIK3) have also been shown 
to interact with NSP through their kinase domain [31]. This 
interaction causes inhibition of the kinase activity of NIKs 
and hence prevents the activation of the signal transduction 
cascade that evokes a plant defense response [32]. These 
RLKs are members of the LRRII subfamily that also 
contains the SOMATIC EMBRYOGENESIS RECEPTOR 
KINASES (SERKs) [44]. 
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With the completion of the tomato genome sequen- 
cing along with the annotation of the encoded proteins 
[45], it has become possible to study the RLK superfam- 
ily in this species using a large-scale phylogenetic ap- 
proach. According to genomic analyses, the tomato 
genome was predicted to have approximately 900 mega- 
bases of DNA and encode 34,727 proteins. In this inves- 
tigation, we identified and classified all putative tomato 
RLKs by comparison with previously described Arabi- 
dopsis RLKs [5]. We also showed that the tomato RLK 
members of LRRII subfamily, which comprises NIK and 
SERK genes, share similar biochemical activity (capacity 
to interact with the geminivirus NSP), genomic structure 
and partial overlapping expression profiles with the Ara- 
bidopsis orthologs. Our results provide a framework for 
understanding RLK function in tomato and reveal that 
some tomato and Arabidopsis LRRII-RLK orthologs may 
play similar roles in antiviral defense. 

Results 

The tomato RLK superfamily 

The identification of the RLK superfamily members in 
tomato was initially performed by a batch BLAST search 
against a tomato protein database (ITAG v2.3, available 
in solgenomics.net) using the kinase sequences of repre- 
sentative Arabidopsis RLKs as queries. This analysis 
retrieved 955 tomato proteins that seemed to be RLKs. 
All of these retrieved tomato proteins were submitted 
for annotation of their domain structure using SMART 
[46] (smart.embl-heidelberg.de) and Pfam [47] (pfam. 
sanger.ac.uk) databases. Four proteins that did not bear 
a kinase domain were not considered for further ana- 
lysis. The remaining 951 proteins were used for phylo- 
genetic analysis based on their kinase domain sequences. 
For this analysis, we included all Arabidopsis RLKs to 
compare with tomato RLKs and used representative 
proteins of other kinase families of Arabidopsis and 
human as outgroups (Additional file 1). All Arabidopsis 
RLKs were placed in a major cluster together with 
647 tomato proteins that were identified as members 
of the RLK superfamily (Figure 1). The other 304 pro- 
teins were clustered with outgroups; consequently, they 
were not considered to be members of RLK superfamily 
(Additional file 2). 

The size of the tomato RLK superfamily (647 RLKs) 
was similar to that of the Arabidopsis RLK superfamily 
(623 RLKs). Furthermore, almost all tomato RLKs (631 
RLKs) were clustered with at least one Arabidopsis RLK. 
Therefore, the tomato RLK superfamily was divided into 
the same 58 subfamilies as described previously for Ara- 
bidopsis [5]. As in Arabidopsis, in which 236 out of all 
623 RLKs belong to leucine-rich repeat (LRR) subfamilies, 
tomato LRR subfamilies were the most abundant and con- 
tained 257 proteins. Another large RLK subfamily was 



RLCK, which included 128 members in tomato, almost 
the same number as in Arabidopsis (150). Among the 16 
tomato RLKs that were not clustered in the same branches 
as Arabidopsis RLKs, ten proteins were quite small and 
lacked a typical RLK structure, but the other six proteins 
had a clear RLK structure and as such were considered to 
be tomato-specific RLKs. Among those six tomato-specific 
RLKs, Solyc03g080060 contained a legume lectin do- 
main similar to members of the lectin subfamily, and 
Solyc02g083410 harbored an amino oxidase domain 
(flavin containing amine oxidoreductase activity), which 
is not found in any Arabidopsis RLKs. The remaining 
four proteins did not have any predicted protein 
domains in their extracellular region. RLK superfamily 
profiles in both species are summarized in Figure 2. 

Although the RLK superfamilies of Arabidopsis and 
tomato share common features, a close inspection 
reveals some interesting differences between them. A 
comparison of membership size of each subfamily 
revealed some differences between the species. To infer 
the number of RLKs in their common ancestral and the 
occurrence of duplication/deletion events after the diver- 
gence of both species, we used a reconciliation method 
[48] to compare the RLK superfamily tree (Figure 1 and 
Additional file 2) with a species tree generated at NCBI 
taxonomy browser (Additional file 3). Then, the fre- 
quency of duplication or deletion events that occurred 
in each RLK subfamily was statistically analyzed. We 
identified twelve RLK subfamilies from these plant spe- 
cies that have significantly expanded/reduced in their 
size (Additional files 3 and 4, test of equal or given pro- 
portion, p< 0.05). In tomato, the LRK10L2, SD1, SD2b 
and LRRXII subfamilies displayed significant number 
of duplication, while in Arabidopsis, expansion was 
observed in DUF26, L-LEC, LRK10L2, SD1, WAK, LRRIa 
and RLCKXII/XIII subfamilies. Conversely, LysMII in 
Arabidopsis and LRRIa and RLCKXII in tomato 
reduced in size. Another distinct aspect of RLKs in 
these plant species refers to the lack of a common do- 
main structure in the extracellular region of RLK 
members of the LRK10L-2 subfamily. Whereas the 
extracellular region of Arabidopsis LRK10L-2 members 
harbors diverse structures, such as thaumatin, glycero- 
phosphoryl diester phosphodiesterase family (GDPD) 
or malectin domains, the tomato RLKs of this same 
subfamily do not contain any predicted domain struc- 
ture in their N-terminal region. 

Further analyses were performed to predict function 
associated with expansion/reduction patterns. As RLKs 
are frequently associated to defense or developmental 
processes, we performed a search using the Gene Ontol- 
ogy (GO) terms [49] for functionally annotated RLKs in 
those categories (Additional file 3) and statistically com- 
pared the proportion of annotated genes in each RLK 
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Figure 1 The tomato RLK superfamily is composed of 647 proteins. Phylogenetic tree constructed by sequence alignment of kinase domain 
of Arabidopsis RLKs together with putative tomato RLKs. The alignment was carried out with CLUSTALW, and the phylogenetic tree 
reconstruction was made using FastTree. Almost all tomato RLKs (red branches) clustered with Arabidopsis RLKs (blue branches). Color ranges 
delimit the RLK subfamilies. LRR subfamilies (light green) are subdivided in 15 groups, and each group is identified in the tree with Roman 
numerals (I to XV). 



subfamily with the proportion of annotated genes in the 
whole RLK superfamily (see Methods for more details). 
Compelling evidence in the literature has demonstrated 
that defense- related genes have high duplication rate 
and are organized in tandem repeats [34,50,51]. We also 
identified the RLKs organized in tandem repeats and 
determined their frequency in each RLK subfamilies 
(Additional file 3). These analyses demonstrated that 
most of the RLK subfamilies, which expanded after Ara- 
bidopsis and tomato species divergence, had their genes 
organized in tandem repeats. Functional annotation ana- 
lysis of their genes revealed that the GO terms of 42 out 



of 214 genes in tandem repeats were associated with 
defense response. RLKs annotated as developmental- 
related were overrepresented in CrRLKl-1, PERK, 
LRRVIIa, LRRXI and LRRXIII subfamilies and all those 
subfamilies did not expand or present genes in tandem 
arrays. The subfamilies that underwent reduction in their 
size were not associated with defense- or development- 
related functions, except for the LRRIa subfamily, which 
may be related to defense response. Consistent with the 
involvement of members of the LRRII subfamily in 
defense and development, the LRRII subfamily of tomato 
and Arabidopsis had significantly high number of 
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Figure 2 The number of members varies in some tomato and 
Arabidopsis RLK subfamilies. The distribution profile of tomato 
and Arabidopsis RLKs in subfamilies (gray bar) and the estimated 
number of RLK in their common ancestral (black bar) are presented. 
Almost all RLK subfamilies described in Arabidopsis have 
representatives in tomato. 



annotated members in either defense or developmental 
categories. 

Motif prediction, genomic structure and phylogenetic 
analysis of the LRRII subfamily 

Compelling evidence in the literature has revealed a fun- 
damental role for members of the Arabidopsis LRRII- RLK 
subfamily as co-receptors for transducing developmen- 
tal and defense signals [52-54]. The potential of the 
members of this subfamily as co- receptors involved in 
the activation of RLK-mediated signal transduction 
prompted us to perform a comprehensive analysis of the 
tomato LRRII-RLK subfamily to uncover related func- 
tions in tomatoes. Based on the phylogenetic tree of all 
members of RLK superfamily, the tomato LRRII-RLK 
subfamily encompassed 13 proteins. The members of 
this group from both plant species have over 600 amino 
acids on average. Phylogenetic analysis of this group 
using full-length protein sequences resulted in a tree 
with three well-resolved clusters; the tomato and 
Arabidopsis proteins were found in all clusters, although 
they had distinct sizes (Figure 3A). These clades were 
termed NIK, SERK and LRRIIc based on annotation of the 
Arabidopsis members in each cluster. The NIK clade is 
formed by seven tomato members and six Arabidopsis 
members, including the three AtNIK genes. The SERK 
clade clustered the five well- characterized SERKs in 
Arabidopsis and three members of the tomato subfamily. 
The LRRIIc clade consisted of three tomato proteins and 
three Arabidopsis proteins whose functions are unknown. 
Motif prediction analysis on these proteins revealed that 
tomato and Arabidopsis LRRII-RLK members display 
similar protein domains organized in the same fashion 
(Figure 4). The consensus structural organization of the 
conserved domains between both species included an 
N-terminal signal peptide followed by a leucine zipper, 
five LRRs at the extracellular side and a transmembrane 
domain separating the N-terminal portion from the 
cytoplasmic C-terminal kinase domain. Among the 
SERK genes of both species, there was also a proline- 
rich domain (SPP) localized between the last LRR and 
the transmembrane domain (Figure 4). Sequence align- 
ment of the LRRII RLKs showed several conserved 
amino acid positions among members of both plant spe- 
cies. Exon/intron boundaries were also well conserved. 
Variation at the sequence level was observed within the 
SPP and signal peptide recognition domains. Among the 
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LRRII clades, the proteins comprising the SERK clade 
were more conserved with a larger number of conserved 
positions compared with the predicted proteins of the 
LRRIIc and NIK clades (Figure 4). Genomic structure 
analysis revealed that in general LRRII genes are 
organized into 11 exons (Figure 5A). Genes that varied in 
this number displayed fused exons, including AtNIKl, 
At5gl0290.1 and SINIK3, or had deleted exons, such as 
At5g63710.1, Solyc02g072310.2.1 and Solyc05g005 140.2.1. 
Intronic regions were larger in tomato members than in 
Arabidopsis members and in SERK genes compared with 
genes from other clades (Figure 5B). 

Expression analysis of LRRII subfamily genes in different 
tissues 

We examined the expression profiles of LRRII-RLK genes 
in different tomato tissues, including leaf, stem, root, 
flower, cotyledon and hypocotyl, by real-time PCR. The 
results are presented in Figure 6 and summarized in 
Figure 3B. Almost all analyzed genes exhibited expression 
in at least one organ, except for Solyc05g005 140.2.1 and 
Solyc02g072310.2.1 in the NIK clade, which had very low 
expression in all tissues tested. The tomato genes in the 
SERK clade were expressed more highly in leaf and cotyle- 
don tissue than in stem or flower tissue. The NIK clade 
genes were highly expressed in diverse organs, such as 
leaves, flowers and roots. The LRRIIc group encompassed 
genes with a higher level of expression in cotyledon, flower 
and leaf tissues and with lower expression in stem tissue. 

The expression data for LRRII subfamily members from 
Arabidopsis (Figure 3C) were extracted from AtGenEx- 
press [55] to examine whether there is some correspond- 
ence in the expression profiles between orthologous pairs 
of tomato and Arabidopsis genes. Statistical analyses of 
correlation between tomato and Arabidopsis expression 
data could not be performed because these data have been 
generated by different methods (qRT-PCR and micro- 
array) and hence they have different units. Nevertheless, a 
subjective comparison of the expression analysis from 
both plants revealed that the majority of the orthologous 
genes displayed partially but not entirely overlapping ex- 
pression profiles (Figure 3D). The orthologous groups that 
presented similar expression profile were AtSERKll 
AtSERK2 and Solyc04g072570, which had high expression 
levels in leaves, AtNIKl and SINIK1, which were lowly 
expressed in stem and cotyledon tissues, and AtNIK3 and 
SINIK3, which were most highly expressed in the leaf. 

Interactions between representatives of the LRRII 
subfamily and NSP of ToYSV 

We have previously shown that NSP from begomovirus 
interacts with members of the LRRII-RLK subfamily, 
such as AtNIKl, AtNIK2 and AtNIK3, to suppress host 
defense, and it interacts with a member of the PERK-like 



RLK subfamily, NSP-ASSOCIATED KINASE (NsAK), to 
potentiate virus infection [31,56]. Either NIK from to- 
mato and NsAK from Arabidopsis were isolated by their 
capacity to interact with NSP through two-hybrid screen- 
ing [43,56]. The NSP interactions with the Arabidopsis 
AtNIKl, AtNIK2 and AtNIK3 and NsAK were further 
confirmed by yeast-two hybrid and in vitro pull down 
assays [31,56]. We have recently shown by bimolecular 
fluorescence complementation (BiFC) assay that NSP also 
interacts with NIK in vivo. Because begomovirus negatively 
impacts tomato cultivation worldwide, we selected 
representatives of tomato RLKs from the LRRII subfamily 
and examined their capability to interact with NSP of 
Tomato Yellow Spot Virus (ToYSV) similar to the 
interaction observed with Arabidopsis NIK genes [31]. 
Yeast two-hybrid experiments were performed using the 
ToYSV-NSP (accession number: YP_459917.1) as prey 
and kinase domains of S1NIK1 (Solyc02g089550), S1NIK2 
(Solyc04g005910), S1NIK3 (Solyc04g039730) and S1BAK1 
(Solycl0g047140) as bait. We also analyzed a PERK repre- 
sentative (Solycl2g007110, SINsAK) that is similar to the 
NSP-interactor PERK-like gene of Arabidopsis (At5g24550, 
AtNsAK). A tomato gene (Solyc03g019980) from the 
LRRXII subfamily, homolog of the Arabidopsis EF-Tu re- 
ceptor (AtEFR), was used as a negative control. Interactions 
between the viral NSP and host proteins were detected 
after co-transforming the yeast cells with both bait and prey 
plasmids and monitoring for histidine prototrophy. NSP 
was found to interact with the kinase domains of S1NIK1, 
S1NIK2, S1NIK3 and S1BAK1 or with the kinase domain of 
the PERK representative SINsAK (Figure 7, upper panel). 
The NSP interactions were specific to the tomato LRRII- 
RLK orthologs and to PERK-like SINsAK because the HIS 
marker gene was not activated in yeast cells co-transformed 
with TYNSP-p22 (pAD-NSP) and with either the empty 
vector or SlEFR-p32 (EFR kinase domain). Furthermore, 
co-transformation of yeast with the NSP interactors fused 
to the GAL4-binding domain and the empty vector expres- 
sing the GAL4-activating domain alone also failed to acti- 
vate the HIS marker gene (Figure 7, lower panel). These 
results expanded our previous observation that NSP-NIK 
complex formation was neither virus -specific nor host- 
specific [31,43]. They also suggest that SINsAK is a NSP 
target during begomovirus infection in tomato. Certainly, 
the in vivo demonstration of these interactions will further 
support these interpretations. 

Discussion 

The structure of the tomato RLK superfamily and the 
proposed evolution of RLK superfamily in plants 

To date, the phylogenetic and structural characterization 
of the RLK superfamily has been limited to the following 
plant species: moss, rice, poplar, soybean and Arabidopsis 
[5,34,35]. The size of these families ranges from 300 to 
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Figure 3 Phylogenetic and expression analysis of LRRII subfamily members. (A) Phylogenetic tree reconstructed by the maximum 
likelihood method (JTT+G+I, bootstrap replicates = 1000) of the LRRII subfamily. Members of this subfamily can be separated in three 
well-supported clades, referred to here as SERK, NIK and LRRIIc clades. Expression analysis of LRRII subfamily members in (B) tomato and (C) 
Arabidopsis in different plant tissues. The expression data of tomato and Arabidopsis members were obtained by qRT-PCR and from normalized 
data from the AtGenExpress database [55], respectively. No expression data were obtained for AtSERK5 (At2g 13800). (D) Tissues with high 
mean-expression are summarized for each gene. Orthologous genes that had similar expression profiles are delimited by brackets. 



1200 proteins (Figure 8), and their extracellular regions 
bear a great variety of protein domain structures. In the 
present investigation, we characterized and generated a 
complete list of the tomato RLK superfamily members 
(Additional files 2 and 5), identifying 647 RLKs, which falls 
in the size range of the Arabidopsis (623 RLKs) and soy- 
bean (605 RLKs) superfamilies. 

Evolutionary analyses of the RLK superfamily has sug- 
gested that the RLK structure was established prior to the 
divergence of land plants from algae because proteins with 
RLK configurations were discovered in the unicellular 
algae Chlamydomonas reinhardtii [34]. Comparative ana- 
lysis of RLKs among moss, rice, poplar and Arabidopsis 
revealed that the RLK superfamily underwent expansion 
in the beginning of the land plant lineage, after the diver- 
gence of angiosperm and bryophyte and independently 
during diversification of each angiosperm lineage. The 
most dramatic expansion was observed in the rice and 
poplar lineages, which have almost twice as many as RLK 



members as Arabidopsis [34]. This evolutionary scenario 
has not been changed by inclusion of data regarding the 
soybean [35] and tomato (this work) RLK superfamily ex- 
pansion (Figure 8). 

The phylogenetic tree of the members of the RLK 
superfamilies in tomato and Arabidopsis revealed that 
most of the RLK subfamilies have maintained approxi- 
mately the same number of RLK members between these 
species. Exceptions were observed for the DUF26, L-LEC, 
LRK10L2, SD1, SD2b, WAK, LRRIa, LRRXII, LRRXIIb, 
RLCKXII/XIII subfamilies, in which specific and extensive 
expansion was observed in one of the two plant species, as 
well as for the LysMII, LRRIa and RLCKXII, in which spe- 
cific reduction was observed (Additional file 3). Functional 
annotations of some Arabidopsis RLKs and the number of 
genes in tandem repeat that compose those subfamilies 
indicated a predominance of genes clustered in tandem 
array and defense-related RLKs (Additional file 3). Some of 
those subfamilies, such as DUF26, L-LEC, SD1, SD2b, 
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Figure 4 (See legend on next page.) 
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Figure 4 Full-length sequence alignment of LRRII subfamily members of tomato and Arabidopsis demonstrate sequence and structure 
conservation. Sequences of SERK, NIK and LRRIIc clades members are represented with blue, black and red letters, respectively. Yellow sites 
represent conserved sites in all sequences, and green sites represent conserved sites in each clade. Red sites represent the exon-exon junctions. 
Domain structures are indicated above the alignment. Roman numerals delimit the 1 1 subdomains of the kinase domain. 
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WAK and LRRIa, had been shown previously to be overre- 
presented in microarray analysis of Arabidopsis under dif- 
ferent stress conditions [34]. Taken together, these results 
are consistent with the previous assumption that specific 
expansions of RLK genes have occurred more frequently 
for those RLKs associated with defense response. 



Specific expansion of tomato RLKs compared to Ara- 
bidopsis occurred in the LRK10L2, LRRXII, SD1 and 
SD2b subfamilies. Interestingly, all of those RLK subfam- 
ilies, except for SD1, were also overrepresented in rice 
when compared with Arabidopsis [5]. The LRRXII sub- 
family comprises the EF-Tu RECEPTOR (EFR) and 
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Figure 5 Genomic structure analysis of members of the LRRII subfamily of tomato and Arabidopsis. (A) Exons are shown as dark boxes 
and introns as grey lines. Green boxes represent fused exons. The number between parentheses represents the number of exons in each gene. 
Almost all genes contain 1 1 exons. (B) Boxplot illustrating the distribution of intron length among LRRII clades of Arabidopsis and tomato. The 
red line marks the average intron length. Note the large length of intronic regions in SERK genes compared with genes from other clades, and in 
tomato sequences compared with Arabidopsis sequences. 
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Figure 6 Expression analyses of tomato members of the LRRII subfamily in various plant organs by qRT-PCR. Expression of each gene 
was quantified using SIAPT1 as an endogenous control. Bars represent the mean expression from three biological samples and two replicates, 
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FLAGELLIN-SENSITIVE 2 (FLS2) in Arabidopsis, and 
Xa21 in rice, all of which are associated with defense 
responses. The expansion of the LRRXII subfamily in cul- 
tivated plants such as tomato and rice has previously been 
suggested to be associated with the accumulation of resist- 
ance genes by intense breeding programs [5]. Likewise, the 
SD subfamily also has representatives involved in defense 
response, such as RK1 and RK3 [57], but this subfamily is 
also strongly associated with the self-incompatibility 
process [58]. Arabidopsis is known to be self-compatible, 
while rice, tomato and the close relative of A. thaliana, A. 
lyrata, are self-incompatible. Thus, the specific deletion of 
some representatives in A. thaliana could have contribu- 
ted to the generation of the self-compatible mode in this 
species. In contrast, some RLK subfamilies were expanded 
specifically in Arabidopsis. Those include DUF26, L-LEC, 
LRK10L2, SD1, WAK, LRRIa and RLCKXII/XIII. Except 
for SD1 and LRK10L2 subfamilies, no significant 
expansion were observed in these RLK subfamilies in to- 
mato, although all of them contain RLKs that are involved 
in defense response, such as FLS2-INDUCED RECEP- 
TOR-KINASE 1 (FRK1, LRRIa) [59], LECTIN RECEPTOR 
KINASE 1.9 (LecRK-1.9, L-LEC) [60], PATHOGENESIS - 
RELATED 5-LIKE RECEPTOR KINASE (PR5K, LRK10L2) 
[61], RESISTANT TO FUSARIUM OXYSPORIUM 1 
(RFOl, WAK) [62] and CYSTEINE-RICH RLK 5 (CRK5, 
DUF26) [63]. 

Another relevant distinction between tomato and Ara- 
bidopsis RLK subfamilies derives from the diversification 
of the extracellular domain patterns of the LRK10L2 
subfamily representatives. Arabidopsis members of the 
LRK10L2 subfamily have unique domain structures, 
such as GDPD, thaumatin and malectin domains, while 



the tomato members do not possess any characterized 
domain structure in their extracellular region. Likewise, 
these domain structures have not been found in rice, 
poplar or moss RLKs, indicating that within the RLK 
superfamily they are specific to Arabidopsis. We also 
identified a tomato-specific RLK that possesses an amino 
oxidase domain in its extracellular region. These RLKs 
may respond to molecular signals not perceived by other 
plants. Although gaining a novel protein structure could 
increase the repertoire of signals perceived by plants, the 
small number of lineage-specific RLKs in tomato, as was 
also reported in rice and poplar, further substantiates the 
hypothesis that the expansion of existing RLK kinase 
subfamilies is the major mechanism of evolution of these 
proteins. 

The members of the RLK superfamily are involved in di- 
verse biological processes at all steps of plant development. 
Thus, the gain or loss of a RLK gene could have serious 
repercussions on plant phenotype. The specific profiles of 
the RLK superfamily found in tomato and Arabidopsis are 
certainly responsible for several differences between these 
plants, such as morphology, reproduction and, import- 
antly, responsiveness to different stress conditions. Among 
tomato and Arabidopsis, few RLK subfamilies have under- 
gone specific expansion or reduction after their speciation. 
This scarcity may indicate that variation in RLK superfam- 
ily profiles in both plants appeared recently. The domesti- 
cation process that tomatoes underwent could have been a 
significant factor contributing to this variation because 
some RLKs have been directly linked to traits targeted by 
artificial selection, such as disease resistance and growth. 
Nevertheless, most of the specific expansions of RLK sub- 
families observed in Arabidopsis were also associated with 
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defense response and had occurred at similar rates (287 
and 259 duplication events in Arabidopsis and tomato, re- 
spectively). These data suggest that, independently of arti- 
ficial selection, Arabidopsis had also expanded and 
developed a specific machinery against abiotic or biotic 
stress response, which argues against the assumption that 
artificial selection leads to resistant genes accumulation. 
To further examine the influence of artificial selection on 
the repertoire of plant RLKs, new genetic resources for 
closely related wild plants are necessary. 

Functional expression analysis of the LRR II subfamily 
members in tomato and Arabidopsis 

The LRRII subfamily contains RLKs with dual functions in 
development and defense response [52-54]. Characterized 
members of this subfamily include (i) SERK genes, which 
are associated with diverse processes, such as brassinoster- 
oid signaling, flagellin, cell death, light and pathogen- 
associated molecular pattern (PAMP) responses [53], and 
(ii) NIK genes, which interact directly with geminivirus 
NSP during viral infection [31,32]. Phylogenetic and pro- 
tein structure analyses on LRRII subfamily members of to- 
mato and Arabidopsis demonstrated that this group is 
highly conserved between these species. In rice, in which 
the RLK superfamily has undergone a large expansion, the 
LRRII subfamily members are also conserved in number 
and sequence, indicating that biochemical pathways regu- 
lated by LRRII-RLKs have essential and conserved roles in 
angiosperm species. 

Although LRRII members have well-conserved amino 
acid sequences among various species [64], expression 
analysis of the members of the tomato and Arabidopsis 
LRRII subfamilies demonstrated that only a few of the 
orthologous pairs resemble in their expression profiles. By 
analogy with some evidence in the literature from other 
plant species, one may envision that these orthologous 
genes could have functionally diverged after the speciation 
event separating tomato and Arabidopsis. Functional di- 
vergence in orthologous genes is not an uncommon event 
in both plants [65,66] and animals [67,68]. For example, 
the CRABS CLAW transcription factor in Arabidopsis is 
expressed in the carpel primordial abaxial region and in 
floral nectarines and regulates carpel morphology and 
nectar development, whereas its orthologous in rice, 
DROOPING LEAF (DL), is expressed in the whole carpel 
primordium and in central undifferentiated cells of leaves, 
where it regulates carpel identity and midrib development 
[66]. The expression of orthologous genes has also been 
shown to vary differently in response to a stress condition. 
In barley, which is tolerant to salinity, the expression of 
genes involved in root development, such as CONSTANS- 
LIKE 3 (COL3), is suppressed by high salinity, whereas the 
expression of the rice orthologous is unchanged under the 



same stress condition [65]. Likewise, a large fraction of 
orthologous pairs of rice and Arabidopsis genes with recep- 
tor activity do not display conserved co-expression [69]. 
Therefore, different patterns of expression between the 
orthologous genes in the LRRII subfamilies from tomato 
and Arabidopsis may be a result of functional divergence 
that occurred between these genes. Functional divergence 
in receptor proteins with a developmental function may 
lead to a dramatic change in the plant phenotype because 
plant development is heavily guided by external signals. For 
example, a tissue that displays high expression of certain 
RLKs is likely to be more sensitive to perception of RLK- 
specific sensing signals, leading to a rapid and effective re- 
sponse. In contrast, reduced expression of an orthologous 
gene from a different species in the same tissue would de- 
crease the effectiveness and delay the signal perception and 
response. This difference in the cell responsiveness to a 
specific signal could represent the differential timing of 
biochemical reactions that are regulated by this signal. In 
developmental process, small differences in reaction time 
may be sufficient to generate a distinct phenotype in the 
plant. In contrast, the orthologous pairs SERK1/SERK2/ 
Solyc04g072570.2.1, which display similar expression pro- 
files (Figure 3), also contain the most conserved extracellu- 
lar and intracellular domains (approximately 80% and 93% 
of sequence identity, respectively). The other two ortholo- 
gous pairs, NIK1/S1NIK1 and NIK3/S1NIK3, which also 
displayed similar expression profiles, also had highly con- 
served extracellular and intracellular regions. In both 
orthologous pairs, sequence identity was approximately 
65% in the extracellular regions and approximately 80% in 
the intracellular regions (Additional file 6). This finding 
may indicate a tight conservation of function between 
these members of the Arabidopsis and tomato LRRII RLK 
subfamilies. 



Conservation of geminivirus interactions with members 
of the RLK family in tomato 

Although most of LRRII subfamily orthologous pairs 
exhibited functional divergence, we showed that the to- 
mato orthologs of the LRRII-RLKs members NIK1, 
NIK2 and NIK3 retain the capacity to interact with 
geminivirus NSP in yeast (Figure 7) [31]. At least for the 
NIK1 and NIK3 ortholog pairs, the functional conserva- 
tion associated with specific protein-protein interactions 
may be linked to the high conservation of their NSP- 
interacting kinase domain (approximately 80% sequence 
identity, Additional file 6) and similarity of expression 
profiles (Figure 3). The current model of NIK-mediated 
defense response posits that the immune receptor pro- 
tects plant against geminiviruses by phosphorylating the 
ribosomal protein L10 (rpLlO) [32,70]. Phosphorylation 
of rpLlO by NIK redirects the ribosomal protein to the 
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nucleus, where it may mount a defense mechanism to 
prevent viral proliferation. During geminivirus infection, 
NSP interacts with the kinase domain of NIKs to inhibit 
their kinase activity, preventing activation of the defense 
response. Despite the high similarity between NIK genes 
and SERK genes, AtBAKl/SERK3 and AtSERKl do not 
functionally replace the AtNIKl role in transducing an 
antiviral signaling response and do not interact with the 
viral NSP [31,32]. In contrast, we found that the AtBAKl 
ortholog from tomato interacts with NSP in yeast. Al- 
though the functional relevance of this interaction in 
planta remains to be determined, it is worth noting that 
the expression profiles of the BAK1 orthologs are not 
similar, as would be expected for functionally divergent 
orthologs. Although both orthologs are ubiquitously 
expressed in the cognate plant species, they are expressed 
to different extents in distinct organs. Whereas AtBAKl 
expression is quantitatively similar and relatively low in all 
organs analyzed, its ortholog from tomato displays a 
higher level of expression in the cotyledons, hypocotyls 
and leaves, where geminivirus infection largely takes place. 
Therefore, the expression profiles of the NSP interactors 
(SINIKs and S1BAK1) seemed to be related with the onset 
of geminivirus infection. Due to the high expression of the 
AtBAKl tomato orthologous in leaves, one may envision 
the existence of evolving selective pressures to diverge the 
corresponding NSP-interacting domains of the BAK1 
orthologs towards functional fitness with regard to gemi- 
nivirus infection. 

In contrast to NIK receptors, which are inhibited by 
NSP interaction, AtNsAK, a member of PERK subfamily, 
interacts with NSP and phosphorylates the viral protein 
in vitro [56]. Loss of nsak function enhances tolerance to 
geminivirus infection, indicating that AtNsAK is a posi- 
tive contributor to geminivirus infection in Arabidopsis. 
Here, we showed that the NsAK tomato orthologous 
retains its capacity to interact with viral NSP. This 
demonstrates that specific members of the RLK family 
have conserved defense functions (such as NIKs) or 
compatibility functions (such as NsAK) in response to 
viral infection. Due to the emergence of new species of 
tomato-infecting begomoviruses that rapidly evolve 
through recombination or pseudo-recombination to pro- 
duce divergent genome sequences that gives the virus an 
advantage over its hosts recognition system, a survey of 
the interactions between NSPs from distinct tomato- 
infecting geminiviruses and SINIKs and SINsAK may add 
insights into the co-evolution of the viral protein and host 
defense/compatibility functions. 

Conclusions 

The RLK superfamily is a large and diverse group of trans- 
membrane receptors that enables plants to perceive a 



diverse array of signals at the cell surface, creating an effi- 
cient mechanism for cell-environment communication. In 
this investigation, we generated a complete list of the 
members of the tomato RLK superfamily, which is made- 
up of 647 proteins. The tomato RLK sequences exhibited 
a typical receptor-like kinase configuration and almost all 
of them were phylogenetically clustered with at least one 
member of the Arabidopsis RLK superfamily. Therefore, 
the tomato RLK superfamily is similarly organized, with 
the same number and identity of subfamilies as previously 
defined for Arabidopsis RLKs. Among the 58 RLK sub- 
families, twelve showed specific and extensive expansion 
or reduction in the number of their RLK members, which 
may be a reflection of lineage -specific responses to various 
biotic and abiotic stresses. The intense breeding programs 
tomatoes have been subjected to may also have contribu- 
ted to the establishment of the current RLK superfamily 
profile in this species. This comprehensive analysis com- 
paring the complete repertory of Arabidopsis and tomato 
RLKs may provide a framework to rationalize future func- 
tional studies of the members of this family. 

Phylogenetic and structural analyses of LRRII subfam- 
ily members from both tomato and Arabidopsis reveal a 
well-conserved group both in terms of sequence and 
protein domain organization. As a consequence, the to- 
mato LRRII-RLK subfamily is organized into the same 
three with phylogenetically supported clades, SERK, NIK 
and LRRIIc clusters. Nevertheless, a comparison of the 
expression between orthologous genes of this subfamily 
demonstrated that the majority of the orthologous pairs 
did not share a similar expression profile, indicating that 
these orthologous LRRII-RLKs may have undergone 
functional divergence. This finding is supported by the 
observation that, in contrast to the Arabidopsis AtBAKl, 
S1BAK1 interacts with the geminivirus NSP and is highly 
expressed in leaves and the cotyledon. This pattern of 
S1BAK1 expression is consistent with the pattern of in- 
fection by tomato-infecting begomoviruses, which infect 
leaf tissues and move through the phloem but do not 
invade roots. Additionally, as immune receptors, the 
orthologous pairs NIK1 and NIK3 displayed both the 
capacity to interact with the begomovirus virulence fac- 
tor NSP and expression profiles that parallel the onset of 
begomovirus infection. Evidence for functional conserva- 
tion between NIK1 orthologs has been previously pro- 
vided with the demonstration that NIK1 from Arabidopsis 
is capable of protecting tomato plants against tomato- 
infecting begomovirus [70]. Collectively, our results indi- 
cate that NIK orthologs retain similar functions as defense 
receptors to protect plant cells against viral attack. There- 
fore, NIK-mediated antiviral signaling likely also operates 
in tomato, suggesting that the tomato NIKs may be good 
candidate targets for engineering resistance against 
tomato-infecting begomoviruses. 
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Methods 

Identification and classification of tomato RLKs 

Tomato RLK proteins were retrieved through a batch 
BLAST analysis (blastp, e-value cutoff = 0.01) [71] using 
an A. thaliana representative of each subfamily of the 
RLK superfamily against a protein database of tomato 
(iTAGv2.3) available on the Sol Genomics Network web- 
site (solgenomics.net) [72]. Through this procedure, 955 
predicted proteins were retrieved and annotated using 
SMART (smart.embl-heidelberg.de) [46] and Pfam (pfam. 
sanger.ac.uk) [47] databases. Among these proteins, 951 
contained a predicted kinase domain and hence were con- 
sidered to be putative RLKs. The sequences of the kinase 
domains of Arabidopsis RLKs, previously described in [5], 
and tomato putative RLKs were submitted to sequence 
alignment and tree reconstruction using ClustalW 
(v. 2.0.12) [73] and FastTree (v. 2.1.4) [74], respectively 
(Figure 1 and Additional file 2) using default parameters. 
The kinase domain of other kinase protein families from 
A thaliana and human were used as outgroups [3,75]. 
The accession numbers for all outgroup members are 
reported in Additional file 1. Those proteins that clustered 
with outgroup members were not considered to be RLKs 
and were discarded from further analysis. Additionally, 
short putative RLKs were deleted manually from the ana- 
lysis. The identified RLK-related tomato sequences com- 
prised a list of 647 members. Tomato RLKs that clustered 
with A. thaliana RLK subfamily members, as defined pre- 
viously in [5], were classified as members of the same sub- 
family. Phylogenetic trees (Figure 1 and Additional file 2) 
and protein schemes (Additional file 2) were generated 
using iTOL tool (itol.embl.de) [76]. 

Inference on duplication/deletion events, identification of 
RLKs in tandem repeats and functional categorization of 
RLKs subfamilies 

Number of members in the common ancestral of Arabi- 
dopsis and tomato and the occurrence of gene duplication 
and deletion were inferred by reconciliation methods 
implemented in Notung (v.2.6) [48]. For this analysis, we 
used the RLK superfamily tree, showed in Figure 1 and in 
Additional file 2, as gene tree. The species tree was gener- 
ated at NCBI taxonomy browser (www.ncbi.nlm.nih.gov/ 
Taxonomy/ CommonTree/wwwcmtcgi). For identification 
of RLKs in tandem repeats, we considered that two genes 
are clustered in tandem array when i) they are classified in 
the same subfamily, ii) they are distant from each other by 
less than lOOkb and iii) they are separated by less than 10 
genes from each other, as previously described in [34]. For 
identification of defense- or development-related genes, we 
used the GO terms associated to the Arabidopsis genes. 
Arabidopsis RLKs that had GO terms related to "response 
to stress" (GO:0006950) and/or "developmental process" 



(GO:0032502) and their child terms were classified as 
defense- and/or developmental-related, respectively. 

Statistical test for expansion/reduction analysis and 
functional categorization of RLK subfamilies 

To statistically verify if (i) RLK subfamilies have differen- 
tially expanded or reduced in their size, (ii) tandem dupli- 
cations or (iii) a functional annotation (defense or 
development) are more often in a specific RLK subfamily, 
we used the test of equal or given proportions [77]. This 
statistical analysis tests if two different proportions (pi and 
p 2 ) are equal (H 0 :p! = p 2 ) or different (H a : p 1 * p 2 ). The 
two tested proportions were the occurrence of a given fea- 
ture (number of duplication/deletion, tandem repeats or 
genes annotated as defense- or developmental-related) in 
a subpopulation (RLK subfamily, p^, and the proportion 
of the number of the same feature in the whole population 
(RLK superfamily, p 2 ). As we analyzed whether those fea- 
tures were overrepresented in a given RLK subfamily, our 
alternative hypothesis was H a : pi > p2. Test calculations 
were performed in R environment. All p-values associated 
with tested values are summarized in Additional file 4. 

Motif prediction, genomic structure and phylogenetic 
analysis of the LRRII subfamily 

Full-length amino acid sequences of members of the 
LRRII subfamily from tomato and Arabidopsis were 
aligned using ClustalW (v. 2.0.12) [73] using the default 
parameters. A phylogenetic tree was constructed using 
the maximum likelihood method (JTT model, bootstrap 
replicates = 1000) implemented in MEGA5 software 
[78]. Motif, signal peptide and transmembrane predic- 
tion were carried out using Pfam [47] and SMART [46] 
databases. The genomic structure of the LRRII subfamily 
members of tomato and Arabidopsis was determined by 
aligning the coding sequence (CDS) of each gene with 
genomic sequences of the respective organism. The 
alignment was carried out using the BLAST algorithm 
(blastn) [71] with high-stringency parameters. Amino 
acid, CDS and genomic sequences for tomato and Arabi- 
dopsis were retrieved from the Sol Genomics Network 
(solgenomics.net) [72] and TAIR (www.arabidopsis.org) 
[79] websites, respectively. 

Protein-protein interaction assays 

The analysis of protein-protein interactions between 
viral NSP and the kinase domain of tomato RLKs was 
performed using the Proquest Yeast Two-Hybrid sys- 
tem with Gateway Technology (Invitrogen Inc.). The to- 
mato RLKs that presented the highest identity with AtNIKl 
(At5gl6000), AtNIK2 (At3g25560), AtNIK3 (Atlg60800), 
AtBAKl (At4g33430) and AtNsAK (At5g24550) were 
selected for the assay. These tomato proteins are referred 
to as S1NIK1 (Solyc02g089550), S1NIK2 (Solyc04g005910), 
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S1NIK3 (Solyc04g039730), S1BAK1 (Solycl0g047140) and 
SINsAK (Solycl2g007110). As a negative control, we used 
the kinase domain of the tomato RLK that displayed the 
highest identity with AtEFR (At5g20480), referred to as 
S1EFR (Solyc03g019980). 

The NSP coding region was amplified from ToYSV 
(Tomato Yellow Spot Virus-Geminiviridae, Begomovirus) 
[80] using gene-specific primers with appropriate exten- 
sions for cloning via the Gateway system, as described in 
Additional file 7. The amplified fragment was cloned into 
pDONR201 to generate pUFV1780.1 and then transferred 
by recombination to pDEST22 yielding pUFV1781, also 
designated as TYNSP-p22. 

For amplification of the C-terminal kinase domain of 
the tomato RLKs, we prepared cDNA from cotyledons of 
wild- type tomato plants (var. Santa Clara). Briefly, total 
RNA from tomato cotyledons was isolated using an 
RNeasy Kit (Qiagen Inc.). First-strand cDNA was synthe- 
sized from 1 \ig of total RNA using the M-MLV Reverse 
Transcriptase (Invitrogen Inc.) according to the manufac- 
turer s instructions. Primers used in the amplification step 
were designed with recombination sites for further cloning 
procedures using the Gateway System (Invitrogen Inc.). 
The primers used are listed in Additional file 7. PCR 
assays were performed using Platinum Taq DNA Polymer- 
ase High Fidelity (Invitrogen Inc.) according to the manu- 
facturer s instructions. The amplified fragments were 
cloned into the entry vector pDONR201 (Invitrogen Inc.) 
and sequenced. The resulting vectors were the follow- 
ing: pUFV1756.1, pUFV1596, pUFV1757.1, pUFV1734.2, 
pUFV1744.1 and pUFV1955.2, corresponding, respectively 
to the fragment encoding the kinase domain of S1NIK1, 
S1NIK2, S1NIK3, S1BAK1, SINsAK and S1EFR. Then, the 
cloned fragment in pDONR201 was transferred to 
pDEST32, which contains the DNA-binding domain of the 
GAL4 promoter (Invitrogen Inc.). This procedure resulted 
in the following recombinant plasmids: pUFV1768.1, 
pUFV1760.1, pUFV1779.1, pUFV1769.1, pUFV1770.1 and 
pUFV1975.1, also designated as SlNIKl-p32, SlNIK2-p32, 
SlNIK3-p32, SlBAKl-p32, SlNsAK-p32 and SlEFR-p32, 
respectively. 

Competent cells of yeast strain AH 109 (Clontech 
Inc., genotype: MATa, trpl-901, leu2-3,112, ura3-52, his 
3200, gal4A, gal80A, LYS2 \\GAL1 UAS -GAL1 TATA -HIS3, 
GAL2 UA s - GAL2 tat a -ADE2, URA3 : MEL1 UAS -MEL1 TATA - 
lacZ) were sequentially co-transformed with TYNSP-p22 
and with one of the pDEST32 constructs. Co-transformed 
yeasts were plated onto synthetic dropout medium lacking 
leucine, tryptophan and histidine, and incubated at 28°C 
Yeast growth was monitored for 5 days. 

Expression analysis of the LRRII subfamily genes 

The expression patterns of genes in the LRRII subfamily 
were assayed by quantitative Real-Time PCR (qRT-PCR) 



in various tomato tissues. Wild-type tomato plants (var. 
Santa Clara) were cultivated in a greenhouse for 45 days 
after germination. Leafs, stems, roots and flowers from 
three plants were collected separately. We also cultivated 
plants in half-strength Murashige and Skoog medium 
(1/2 MS, Sigma- Aldrich Co.) for 10 days after germin- 
ation under normal conditions to collect cotyledons and 
hypocotyls tissue. For these tissues, due to the small 
amount of material, each sample represented a pool of 
three young plants. Total RNA from each sample was 
extracted using TRIzol (Invitrogen Inc.), and the quality 
and integrity of extracted RNA were monitored by 
spectrophotometry and electrophoresis. For cDNA syn- 
thesis, 3 \xg of total RNA from each sample was first 
treated with RNase-free DNAse I (Promega Inc.) and 
then reverse-transcribed using M-MLV Reverse Tran- 
scriptase (Invitrogen Inc.) and oligo-dT primers. qRT- 
PCR assays were performed using an ABI7500 Real Time 
PCR System (Applied Biosystems Inc.) and SYBR® Green 
PCR Master Mix (Applied Biosystems Inc.). The amplifi- 
cation reactions were performed using default para- 
meters for thermal cycling (50° for 10 min, 95° for 1 
min, followed by 40 cycles of 95° for 15 sec and 60° for 1 
min). Primers were designed using PerlPrimer [81], 
attempting to choose primer pairs in which at least one 
of them extended across an intron-exon boundary. 
Expression quantification of each gene was determined 
according to the Ct relative quantification method 
(2" ACt ) [82] using SIAPT1 (adenine phosphoribosyl 
transferase, Solyc04g077970.2.1) as an endogenous con- 
trol for data normalization. Expression data from Arabi- 
dopsis were obtained from the AtGenExpress website 
(jsp.weigelworld.org/expviz/expviz.jsp) [55] . 

Additional files 



Additional file 1: List of outgroup proteins. Summary of the names 
and accession numbers of proteins used as outgroups in the 
phylogenetic tree of Figure 1 and Additional file 2. 

Additional file 2: RLK Phylogenetic tree of tomato and Arabidopsis. 

This is the same phylogenetic tree as presented in Figure 1, but 
displayed in more details. It contains additionally the accession numbers 
and schemes of the domain structures of each protein that composes 
the tree. Tomato proteins are represented by red branches and 
Arabidopsis proteins by blue branches. The local support values at the 
nodes were computed by resampling the site likelihoods 1,000 times and 
performing the Shimodaira-Hasegawa test. 

Additional file 3: Expansion/reduction in Arabidopsis and tomato 
RLK subfamilies and functional inference. The membership size of RLK 
subfamilies in Arabidopsis (At) and tomato (SI) is indicated . Values in 
bold and with asterisks indicate statistical significance by the test of 
equal or given proportions (a=0.05). Subfamilies with significantly large 
proportion of duplication (dup.) or deletion (del.) were considered to 
have specifically expanded or reduced respectively after the divergence 
of Arabidopsis and tomato species. Subfamilies that presented statistically 
large proportion of RLKs organized in tandem repeats (t.r.) and/or of RLKs 
functionally annotated in defense response (def.) category were 
considered to be defense-related (red arrows). Conversely, subfamilies 
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with significantly large proportion of members annotated in 
developmental process (dev.) category were classified as development- 
related (blue arrows). Green arrow indicates the LRRII subfamily that 
presented large proportion in both functional categories. Legend: dup.: 
duplication events; del.: deletion events. 

Additional file 4: Analyses on expansion/reduction in Arabidopsis 
and tomato RLK subfamilies and on their functional inference. The 

table contains information from Additional file 3 and presents the 
associated p-value from each test performed. 

Additional file 5: List of Arabidopsis and tomato RLKs and their 
respective RLK subfamilies. Summary of all RLK IDs presented in the 
tree of Additional file 2. 

Additional file 6: Sequence identity between members of LRRII-RLK 
subfamily of tomato and Arabidopsis. (A) Full length amino acid 
sequences, (B) intracellular and (C) extracellular regions of LRRII subfamily 
members were aligned using CLUSTALW. Thick lines delimit the 
sequence comparison between members of the same clade (NIK, SERK, 
LRRIIc). Blue cells indicate high sequence identity, whereas red cells 
denote low sequence identity. 

Additional file 7: List of primers used for yeast two-hybrid assay 
and for expression analysis by real-time PCR analysis. Summary of all 
primers used for gene cloning and real-time PCR experiments. 
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